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3 <110> APPLICANT: WALLAART, Thorvald Eelco 

4 BOUWMEESTER, Hendrik Jan 

6 <120> TITLE OF INVENTION: Transgenic Ainorpha-4, 11-Diene Synthesis 
8 <130> FILE REFERENCE: 702 010272 
10 <140> CURRENT APPLICATION NUMBER: 09/763, 822A 
C--> 11 <141> CURRENT FILING DATE: 2003-05-05 

13 <150> PRIOR APPLICATION NUMBER: PCT/EP99/06302 

14 <151> PRIOR FILING DATE: 1999-08-27 
16 <160> NUMBER OF SEQ ID NOS : 14 

18 <170> SOFTWARE: MS Word 97 SR-2 

20 <210> SEQ ID NO: 1 f*^ K I ^ P\ 

21 <211> LENGTH: 15 tN I tHtU 

22 <212> TYPE: DNA 

23 <213> ORGANISM: Artificial Sequence 

25 <220> FEATURE: 

26 <223> OTHER INFORMATION: EcoR I (Not I) adapter 

28 <400> SEQUENCE: 1 

29 gtcgacgcgg ccgcg 15 

31 <210> SEQ ID NO: 2 

32 <211> LENGTH: 19 

33 <212> TYPE: DNA 

34 <213> ORGANISM: Artificial Sequence 

36 <220> FEATURE: 

37 <223> OTHER INFORMATION: EcoR I (Not I) adapter 

39 <400> SEQUENCE: 2 

40 cagctgcgcc ggcgcttaa 19 

42 <210> SEQ ID NO: 3 

43 <211> LENGTH: 27 

44 <212> TYPE: DNA 

45 <213> ORGANISM: Artificial Sequence 

47 <220> FEATURE: 

48 <223> OTHER INFORMATION: Sense primer (primer C) used in PCR amplification 

50 <4 00> SEQUENCE: 3 

51 gtcgacaaac catggcactt acagaag 27 

53 <210> SEQ ID NO: 4 

54 <211> LENGTH: 32 

55 <212> TYPE: DNA 

56 <213> ORGANISM: Artificial Sequence 

58 <220> FEATURE: 

59 <223> OTHER INFORMATION: Antisense primer (primer D) used in PCR amplification 

61 <400> SEQUENCE: 4 

62 ggatggatcc tcatatactc ataggataaa eg 32 
64 <210> SEQ ID NO: 5 
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65 <211> LENGTH: 23 

66 <212> TYPE: DNA 

67 <213> ORGANISM: Artificial Sequence 

69 <220> FEATURE: 

70 <223> OTHER INFORMATION: Sense primer (primer G) used in PGR amplification 

72 <400> SEQUENCE: 5 

73 gaggatccat gtcacttaca gaa 23 

75 <210> SEQ ID NO: 6 

76 <211> LENGTH: 24 

77 <212> TYPE: DNA 

78 <213> ORGANISM: Artificial Sequence 

80 <220> FEATURE: 

81 <223> OTHER INFORMATION: Antisense primer (primer H) used in PGR amplification 

83 <400> SEQUENCE: 6 

84 atggatcctc atatactcat agga 24 

86 <210> SEQ ID NO: 7 

87 <211> LENGTH: 22 

88 <212> TYPE: DNA 

89 <213> ORGANISM: Artificial Sequence 

91 <220> FEATURE: 

92 <223> OTHER INFORMATION: Sense primer {primer E) used in PGR amplification 

94 <400> SEQUENCE: 7 

95 cgagaattca tgtcacttac ag 22 

97 <210> SEQ ID NO: 8 

98 <211> LENGTH: 22 

99 <212> TYPE: DNA 

100 <213> ORGANISM: Artificial Sequence 

102 <220> FEATURE: 

103 <223> OTHER INFORMATION: Antisense primer (primer F) used in PGR amplification 

105 <400> SEQUENCE: 8 

106 ggatctcgag tcatatactc at 22 

108 <210> SEQ ID NO: 9 

109 <211> LENGTH: 538 

110 <212> TYPE: DNA 

111 <213> ORGANISM: Artificial Sequence 

113 <220> FEATURE: 

114 <223> OTHER INFORMATION: Nucleotide sequence of probe generated by PGR with primers A 



115 and B 

117 <400> SEQUENCE: 9 

118 gatgagaatg ggaaatttaa ggaatcgtta gctaatgatg ttgaaggttt gcttgagttg 60 
120 tacgaagcaa cttctatgag ggtacctggg gagattatat tagaagatgc tcttggtttt 120 
122 acacgatctc gtcttagcat tatgacaaaa gatgcttttt ctacaaaccc cgctcttttt 180 
124 accgaaatac aacgggcact aaagcaaccc ctttggaaaa ggttgccaag aatagaggcg 240 
126 gcgcagtaca ttcctttcta tcaacaacaa gattctcata acaagacttt acttaaactt 300 
128 gctaagttag agttcaattt gcttcagtca ttgcacaagg aagagctcag ccatgtgtgc 360 
130 aaatggtgga aagctttcga tatcaagaag aacgcacctt gtttaagaga tagaattgtt 420 
132 gaatgctact tttggggact aggttcaggc tatgagccac agtattcccg ggctagagtt 480 
134 ttcttcacaa aagctgttgc tgttataact cttatagacg acaccttcga cgctacgg 538 
136 <210> SEQ ID NO: 10 
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137 <211> LENGTH: 179 

138 <212> TYPE: PRT 

139 <213> ORGANISM: Artificial Sequence 

141 <220> FEATURE: 

142 <223> OTHER INFORMATION: Deduced amino acid sequence of probe generated by PGR with 

143 primers A and B 
145 <400> SEQUENCE: 10 



146 


Asp 


Glu 


Asn 


Gly 


Lys 


Phe 


Lys 


Glu 


Ser 


Leu 


Ala 


Asn 


Asp 


Val 


Glu 


Gly 


147 


1 








5 










10 










15 




148 


Leu 


Leu 


Glu 


Leu 


Tyr 


Glu 


Ala 


Thr 


Ser 


Met 


Arg 


Val 


Pro 


Gly 


Glu 


He 


149 








20 










25 










30 






150 


He 


Leu 


Glu 


Asp 


Ala 


Leu 


Gly 


Phe 


Thr 


Arg 


Ser 


Arg 


Leu 


Ser 


He 


Met 


151 






35 










40 










45 








152 


Thr 


Lys 


Asp 


Ala 


Phe 


Ser 


Thr 


Asn 


Pro 


Ala 


Leu 


Phe 


Thr 


Glu 


He 


Gin 


153 




50 










55 










60 










154 


Arg 


Ala 


Leu 


Lys 


Gin 


Pro 


Leu 


Trp 


Lys 


Arg 


Leu 


Pro 


Arg 


He 


Glu 


Ala 


155 


65 










70 










75 










80 


156 


Ala 


Gin 


Tyr 


He 


Pro 


Phe 


Tyr 


Gin 


Gin 


Gin 


Asp 


Ser 


His 


Asn 


Lys 


Thr 


157 










85 










90 










95 




158 


Leu 


Leu 


Lys 


Leu 


Ala 


Lys 


Leu 


Glu 


Phe 


Asn 


Leu 


Leu 


Gin 


Ser 


Leu 


His 


159 








100 










105 










110 






160 


Lys 


Glu 


Glu 


Leu 


Ser 


His 


Val 


Cys 


Lys 


Trp 


Trp 


Lys 


Ala 


Phe 


Asp 


He 


161 






115 










120 










125 








162 


Lys 


Lys 


Asn 


Ala 


Pro 


Cys 


Leu 


Arg 


Asp Arg 


He 


Val 


Glu 


Cys 


Tyr 


Phe 


163 




130 










135 










140 










164 


Trp 


Gly 


Leu 


Gly 


Ser 


Gly 


Tyr 


Glu 


Pro 


Gin 


Tyr 


Ser 


Arg 


Ala 


Arg 


Val 


165 


145 










150 










155 










160 


166 


Phe 


Phe 


Thr 


Lys 


Ala 


Val 


Ala 


Val 


He 


Thr 


Leu 


He 


Asp 


Asp 


Thr 


Phe 


167 










165 










170 










175 





168 
170 
171 
172 
173 
175 
176 
synthase 
177 
179 
180 
182 
184 
186 
188 
190 
192 
194 
196 
198 
200 



Asp Ala Thr 
<210> SEQ ID NO: 11 
<211> LENGTH: 2112 
<212> TYPE: DNA 

<213> ORGANISM: Artemisia annua L. 
<220> FEATURE: 

<223> OTHER INFORMATION: Nucleotide sequence of a positive clone (amorphadiene 



encoding gene) 
<400> SEQUENCE: 11 
aattcgcggc cgcgtcgaca 
gccaactttc ctccaagcat 
caaggggtgg aacagatagt 
gctttggata ttcctatgaa 
cttggaatac cgtatcactt 
acatatggtg ataactggaa 
caaggatatt atgttacatg 
aagcaatcgt tagctaatga 
agggtacctg gggagattat 
attatgacaa aagatgcttt 
ctaaagcaac ccctttggaa 



isolated from the cDNA 



aatcatgtca 
ttggggagat 
gaatgattta 
acatgccaat 
tgaacgggag 
tggtgaccgc 
tgatgttttc 
tgttgaaggt 
attagaagat 
ttctacaaac 
aaggttgcca 



cttacagaag 
cagtttctca 
aaaaaagaag 
ttgttgaagc 
attgatcatg 
tcttccttat 
aataactata 
ttgcttgagt 
gctcttggtt 
cccgctcttt 
agaatagagg 



library of 


induced A. annua 


aaaaacctat 


tcgccccatt 


60 


tctatcaaaa 


gcaagtagag 


120 


tgcggcaact 


actaaaagaa 


180 


tgattgatga 


aattcaacgc 


240 


cattgcaatg 


tatttatgaa 


300 


ggttccgtct 


tatgcgaaag 


360 


aagacaaaaa 


tggagcgttc 


420 


tgtacgaagc 


aacttctatg 


480 


ttacacgatc 


tcgtcttagc 


540 


ttaccgaaat 


acaacgggca 


600 


cggcgcagta 


cattcctttc 


660 
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202 tatcaacaac aagattctca taacaagact ttacttaaac ttgctaagtt agagttcaat 
204 ttgcttcagt cattgcacaa ggaagagctc agccatgtgt gcaaatggtg gaaagctttc 
206 gatatcaaga agaacgcacc ttgtttaaga gatagaattg ttgaatgcta cttttgggga 
208 ctaggttcag gctatgagcc acagtattcc cgggctagag ttttcttcac aaaagctgtt 
210 gctgttataa ctcttataga tgacacttat gatgcgtatg gtacttatga agaacttaag 
212 atctttactg aagctgttga aaggtggtca attacatgct tagacacact tccagaatac 
214 atgaaaccga tatacaaatt attcatggat acatacacag aaatggaaga atttcttgca 
216 aaggagggaa gaacagatct atttaactgc ggcaaagaat ttgtgaaaga gtttgttaga 
218 aacctgatgg ttgaagcaaa atgggcaaat gagggacaca taccaaccac tgaagagcat 
220 gatccagttg taatcattac tggcggtgct aacctgctta caacaacttg ttatcttggc 
222 atgagtgata tattcacaaa agagtctgtc gaatgggctg tctctgcacc tcctcttttt 
224 agatactcag gtatacttgg tcgacgccta aatgatctca tgacccacaa ggccgagcaa 
226 gaaagaaaac atagttcatc gagccttgaa agttatatga aggaatataa tgtcaatgag 
228 gagtatgccc aaaccttgat ttacaaggaa gtagaagatg tgtggaaaga tataaaccga 
230 gagtacctca caactaaaaa cattccaagg ccgttattga tggctgtgat ctatttgtgc 
232 cagtttcttg aagttcaata tgcaggaaag gataacttca cacgtatggg agacgaatac 
234 aaacatctca taaagtctct actcgtttat cctatgagta tatgactacc aatccttcgt 
236 gcatagccta tcaattatat tgaaagggtt aactatgcac gtctctatgg agagaatttc 
238 tcaagctatt tggtgtttct tgctggcaat aataaatcag acgcataaaa ttgtattgaa 
240 ctatatgccg atagctattt aaagttatta tacaactaaa atattcaaca atggtattat 
242 acttttactt tgtacaaaag caaaagtaca ctactgttat gtaacatttt agttctatga 
244 tactttagtt acgaatcggc ttatatacat tgatacactt ttatgcagaa aaccctagta 
24 6 aataaaaagt cgatatcttg tactacacat atcgcacgaa tttccgtttg ccgtttgtat 
248 tttacgatat gttatttaat gaatatgttt catgtggttg ttgcttaaaa aaaaagtcga 
250 cgcggccgcg aa 

252 <210> SEQ ID NO: 12 

253 <211> LENGTH: 697 

254 <212> TYPE: PRT 

255 <213> ORGANISM: Artemisia annua L. 

257 <220> FEATURE: 

258 <223> OTHER INFORMATION: Deduced amino acid 
(amorphadiene 

259 synthase encoding gene) isolated from 

260 induced A. annua 

262 <400> SEQUENCE: 12 

263 Asn Ser Arg Pro Arg Arg Gin lie Met Ser Leu Thr Glu Glu Lys Pro 

264 15 10 15 
2 65 lie Arg Pro lie Ala Asn Phe Pro Pro Ser lie Trp Gly Asp Gin Phe 

266 20 25 30 

267 Leu lie Tyr Gin Lys Gin Val Glu Gin Gly Val Glu Gin lie Val Asn 

268 35 40 45 

269 Asp Leu Lys Lys Glu Val Arg Gin Leu Leu Lys Glu Ala Leu Asp lie 

270 50 55 60 

271 Pro Met Lys His Ala Asn Leu Leu Lys Leu lie Asp Glu lie Gin Arg 

272 65 70 75 80 

273 Leu Gly He Pro Tyr His Phe Glu Arg Glu He Asp His Ala Leu Gin 

274 85 90 95 

275 Cys He Tyr Glu Thr Tyr Gly Asp Asn Trp Asn Gly Asp Arg Ser Ser 

276 100 105 110 

277 Leu Trp Phe Arg Leu Met Arg Lys Gin Gly Tyr Tyr Val Thr Cys Asp 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2112 



sequence of a positive clone 
the cDNA library of 



file://C:\CRF4\Outhold\VsrI763822A.htm 



RAW SEQUENCE LISTING DATE: 07/17/2003 

PATENT APPLICATION: US/09/763 , 822A TIME: 11:32:33 



Input Set : A:\W0008463.txt 

Output Set: N:\CRF4\07172003\I763822A.raw 



278 






115 










120 










125 








279 


Val 


Phe 


Asn 


Asn 


i y X 


Lys 


Asp 


Lys 


Asn 


Gly 


Ala 


Phe 


Lys 


Gin 


Ser 


Leu 


280 




130 










135 










140 










281 


Ala 


Asn 


Asp 


Val 


Glu 


Gly 


Leu 


Le u 


Glu 


Leu 


T vr 
1 yx 


Glu 


Ala 


Thr 


Ser 


Met 


282 


145 










150 










155 










160 






V ct X 




ljx y 


ox LI 


X X t; 


T 1 A 

X X 


X/eu 


wX U 


A cr^ 


rtx d 


ItSU 


kjx y 


r i It; 


Thr 

i I IX 


rix y 


284 










165 










170 










175 




9 ft 

/COO 






Leu 




X X t: 




1 1 IX 


Lys 




Al 

rtX d 


r lie 


OcX 


Th r- 

1 J iX 


A c^n 


r X U 


Al p 

r\X- d 


^ 0 o 








1 ft n 

X 0 VJ 










1 ft 
X O J 










1 Qn 

X _? u 






9 ft 7 


Lsu 


XT lit; 


i Ilx 


oX U 


X X fc; 


^ciXXi 


Arg 


A 1 


Leu 


Lys 




r X O 


Leu 


irp 


Lys 


riX y 


288 






195 










200 










205 








9R Q 
z. 0 


Leu 




Zi vrr 
riX y 


X xe 


V3X U 


riXd 


rixa 


P 1 n 
vpXI 1 


iyr 


T 1 a 
X Xfc: 


r X U 


r life; 


iyr 


Pin 
o Xl 1 


PI n 

'JJ X i 1 


r;i n 

O X 1 1 


9 QO 




91 n 

Z X u 










91 S 

Z X D 










9 90 

z z u 










9 QT 


/iSp 


O t!x 


Hie 
nx o 




Lys 


1 fix 


Leu 


Leu 


Lys 


Leu 


A 1 PI 
rlXd 


Lys 


Leu 


PI n 
oX U 


Ph^a 
r 1 ifc: 




9 Q9 


9 9 R 
Z Z O 










9 "^n 

Z ^ U 










Z -Z> o 










9 4 0 
Z 4 u 


9 Q'^ 

z yj? 


Leu 


Leu 


uxn 


oer 


Leu 


nl S 


Lys 


PI 11 

oXU 


Pin 
oXU 


Leu 


Q ci V- 

oer 


nl S 


\7=i 1 
VdX 


uys 


Lys 


irp 


z y ^ 










Z 4 0 










Z 0 u 










9 s ^ 
zoo 




9 Q 
Z I* 0 


Trp 


Lys 


Ria 


rne 


Asp 


X xe 


Lys 


Lys 


Asn 


/\Xd 


Pro 


Cys 


Leu 


Arg 


Asp 


Arg 


9 O 
Z 3? O 








9 f^n 

Z DU 










9 ^^ 

Z DO 










97 n 
z / u 






9 Q7 

z y / 


T "J 


V ax 


oxU 


Cys 


Tyr 


rne 


Trp 


P 1 ^7 

taxy 


Leu 


P 1 

Vjxy 


oer 


PI 

(jxy 


Tyr 


Pin 

oXU 


Pro 


P 1 m 

oxn 


z yy 






O 7 c; 
Z / 3 










o D n 
zo U 










9 Q R 








9 00 


Tyr 


SejT 


Arg 


/ix a 


Arg 


V ax 


rne 


rne 


i nr 


Lys 


/ixa 


vax 


/ix a 


vax 


T 1 o 

X xe 


Th T 

I nr 


no 




9 on 










Z y D 




















Jul 


Leu 


lie 


Asp 


Asp 


Thr 


Tyr 


Asp 


Ala 


Tyr 


uxy 


Thr 


Tyr 


Pin 

LjXU 


Pin 

uxu 


Leu 


Lys 


'^09 
JjUZ 


J U D 




















J)X D 










JjZ U 


J U -5 


1 xe 


rue 


i nr 


Pill 

oX u 


i\xa 


V ax 


Pin 

oXU 


Arg 


Trp 


oer 


X xe 


1 nr 


cys 


Leu 


Asp 


Thr^ 

i nr 


J U 4 










"^9 s 
J z o 




















O -J o 






Leu 


r xO 


oXU 


iyr 


Lxie L 


Lys 


P V/^ 

r X O 


He 


Tyr 


Lys 


Leu 


Phci 

rne 


riex 


Asp 


Thr- 

1 nr 


T\ rT- 

iyr 


J u o 








4 n 

O 4 u 










345 










o o u 








I LIT 


tjlU 


ixtet 


Pin 


PI n 
toXU 


rne 


Leu 


Ala 


Lys 


Pin 

taXU 


PI \r 

(axy 


Arg 


Th T- 

i nr 


Asp 


Leu 


Ph£i 

rne 


CtKJO 






R "=1 
J) o □ 










360 


















JJ U j3 


Asn 


uy s 


oi y 


Lys 


PI n 
oXU 


PViCi 

rne 


vax 


Lys 


Glu 


P"h ci 

rne 


vax 


Arg 


Asn 


Leu 


iYiei. 


\7=i 1 

vax 


310 




370 










37 5 










380 










O X -L 






Lys 


i rp 


^X d 


A o r-\ 


P 1 n 

Vj X U. 


Gly 


His 


X Xt; 


P r"o 
r X U 


Thr- 
1 I IX 


Thr- 

1 i iX 




Pin 
VjX U. 


n X o 


312 


385 










390 










395 










400 


o 1 o 


Asp 


r iro 


V ai 


V ax 


X xe 


X xe 


xxir 


Gly Gly 


Hxa 


Asn 


Leu 


Leu 


Th r- 

i nr 


Th r- 

i nr 


Thr- 

i nr 


314 










405 










410 










415 




315 


Cys 


Tyr 


Leu 


Gly 


Met 


Ser 


Asp 


He 


Phe 


Thr 


Lys 


Glu 


Ser 


Val 


Glu 


Trp 


316 








420 










425 










430 






317 


Ala 


Val 


Ser 


Ala 


Pro 


Pro 


Leu 


Phe 


Arg 


Tyr 


Ser 


Gly 


He 


Leu 


Gly 


Arg 


318 






435 










440 










445 








319 


Arg 


Leu 


Asn 


Asp 


Leu 


Met 


Thr 


His 


Lys 


Ala 


Glu 


Gin 


Glu 


Arg 


Lys 


His 


320 




450 










455 










460 










321 


Ser 


Ser 


Ser 


Ser 


Leu 


Glu 


Ser 


Tyr 


Met 


Lys 


Glu 


Tyr 


Asn 


Val 


Asn 


Glu 


322 


465 










470 










475 










480 


323 


Glu 


Tyr 


Ala 


Gin 


Thr 


Leu 


He 


Tyr 


Lys 


Glu 


Val 


Glu 


Asp 


Val 


Trp 


Lys 


324 










485 










490 










495 




325 


Asp 


He 


Asn 


Arg 


Glu 


Tyr 


Leu 


Thr 


Thr 


Lys 


Asn 


He 


Pro 


Arg 


Pro 


Leu 


326 








500 










505 










510 
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SEQUENCE LISTING 



<110> WALLAART, Thorvald Eelco 
BOUWMEESTER, Hendrik Jan 

<120> Transgenic Amorpha-4, 11-Diene Synthesis 

<130> 702 010272 

<140> 09/763, 822 
<141> 2001-02-26 

<150> PCT/EP99/06302 
<151> 1999-08-27 

<160> 14 

<170> MS Word 97 SR-2 

<210> 1 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> EcoR I (Not I) adapter 
<400> 1 

gtcgacgcgg ccgcg 15 

<210> 2 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> EcoR I (Not I) adapter 
<400> 2 

cagctgcgcc ggcgcttaa 19 

<210> 3 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sense primer (primer C) used in PGR amplification 
<400> 3 

gtcgacaaac catggcactt acagaag 27 

<210> 4 
<211> 32 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Antisense primer (primer D) used in PGR amplification 
<400> 4 

ggatggatcc tcatatactc ataggataaa eg 

<210> 5 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sense primer (primer G) used in PGR amplification 
<400> 5 

gaggatccat gtcacttaca gaa 

<210> 6 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Antisense primer (primer H) used in PGR amplification 
<400> 6 

atggatcctc atatactcat agga 

<210> 7 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sense primer (primer E} used in PGR amplification 
<400> 7 

cgagaattca tgtcacttac ag 

<210> 8 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Antisense primer (primer F) used in PGR amplification 
<400> 8 

ggatctcgag tcatatactc at 

<210> 9 
<211> 538 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Nucleotide sequence of probe generated by PGR with primers A 



and B 
<400> 9 



gatgagaatg 


ggaaatttaa 


ggaatcgtta 


gctaatgatg 


ttgaaggttt 


gcttgagttg 


60 


tacgaagcaa 


cttctatgag 


ggtacctggg 


gagattatat 


tagaagatgc 


tcttggtttt 


120 


acacgatctc 


gtcttagcat 


tatgacaaaa 


gatgcttttt 


ctacaaaccc 


cgctcttttt 


180 


accgaaatac 


aacgggcact 


aaagcaaccc 


ctttggaaaa 


ggttgccaag 


aatagaggcg 


240 


gcgcagtaca 


ttcctttcta 


tcaacaacaa 


gattctcata 


acaagacttt 


acttaaactt 


300 


gctaagttag 


agttcaattt 


gcttcagtca 


ttgcacaagg 


aagagctcag 


ccatgtgtgc 


360 


aaatggtgga 


aagctttcga 


tatcaagaag 


aacgcacctt 


gtttaagaga 


tagaattgtt 


420 


gaatgctact 


tttggggact 


aggttcaggc 


tatgagccac 


agtattcccg 


ggctagagtt 


480 


ttcttcacaa 


aagctgttgc 


tgttataact 


cttatagacg 


acaccttcga 


cgctacgg 


538 



<210> 10 
<211> 179 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Deduced amino acid sequence of probe generated by PGR with 
primers A and B 



<400> 10 



Asp 


Glu 


Asn 


Gly 


Lys 


Phe 


Lys 


Glu 


Ser 


Leu 


Ala 


Asn 


Asp 


Val 


Glu 


Gly 


1 








5 










10 










15 




Leu 


Leu 


Glu 


Leu 
20 


Tyr 


Glu 


Ala 


Thr 


Ser 
25 


Met 


Arg 


Val 


Pro 


Gly 
30 


Glu 


He 


He 


Leu 


Glu 
35 


Asp 


Ala 


Leu 


Gly 


Phe 
40 


Thr 


Arg 


Ser 


Arg 


Leu 
45 


Ser 


He 


Met 


Thr 


Lys 
50 


Asp 


Ala 


Phe 


Ser 


Thr 
55 


Asn 


Pro 


Ala 


Leu 


Phe 
60 


Thr 


Glu 


He 


Gin 


Arg 


Ala 


Leu 


Lys 


Gin 


Pro 


Leu 


Trp 


Lys 


Arg 


Leu 


Pro 


Arg 


He 


Glu 


Ala 


65 










70 










75 










80 


Ala 


Gin 


Tyr 


He 


Pro 
85 


Phe 


Tyr 


Gin 


Gin 


Gin 
90 


Asp 


Ser 


His 


Asn 


Lys 
95 


Thr 


Leu 


Leu 


Lys 


Leu 
100 


Ala 


Lys 


Leu 


Glu 


Phe 
105 


Asn 


Leu 


Leu 


Gin 


Ser 
110 


Leu 


His 


Lys 


Glu 


Glu 
115 


Leu 


Ser 


His 


Val 


Cys 
120 


Lys 


Trp 


Trp 


Lys 


Ala 
125 


Phe 


Asp 


He 


Lys 


Lys 
130 


Asn 


Ala 


Pro 


Cys 


Leu 
135 


Arg 


Asp 


Arg 


He 


Val 
140 


Glu 


Cys 


Tyr 


Phe 


Trp 


Gly 


Leu 


Gly 


Ser 


Gly 


Tyr 


Glu 


Pro 


Gin 


Tyr 


Ser 


Arg 


Ala 


Arg 


Val 


145 










150 










155 










160 


Phe 


Phe 


Thr 


Lys 


Ala 
165 


Val 


Ala 


Val 


He 


Thr 
170 


Leu 


He 


Asp 


Asp 


Thr 
175 


Phe 



Asp Ala Thr 



<210> 11 
<211> 2112 



<212> DNA 

<213> Artemisia annua L. 
<220> 

<223> Nucleotide sequence of a positive clone ( amorphadiene synthase 

encoding gene) isolated from the cDNA library of induced A. annua 

<400> 11 



aattcgcggc 


cgcgtcgaca 


aatcatgtca 


cttacagaag 


aaaaacctat 


tcgccccatt 


60 


gccaactttc 


ctccaagcat 


ttggggagat 


cagtttctca 


tctatcaaaa 


gcaagtagag 


120 


caaggggtgg 


aacagatagt 


gaatgattta 


aaaaaagaag 


tgcggcaact 


actaaaagaa 


180 


gctttggata 


ttcctatgaa 


acatgccaat 


ttgttgaagc 


tgattgatga 


aattcaacgc 


240 


cttggaatac 


cgtatcactt 


tgaacgggag 


attgatcatg 


cattgcaatg 


tatttatgaa 


300 


acatatggtg 


ataactggaa 


tggtgaccgc 


tcttccttat 


ggttccgtct 


tatgcgaaag 


360 


caaggatatt 


atgttacatg 


tgatgttttc 


aataactata 


aagacaaaaa 


tggagcgttc 


420 


aagcaatcgt 


tagctaatga 


tgttgaaggt 


ttgcttgagt 


tgtacgaagc 


aacttctatg 


480 


agggtacctg 


gggagattat 


attagaagat 


gctcttggtt 


ttacacgatc 


tcgtcttagc 


540 


attatgacaa 


aagatgcttt 


ttctacaaac 


cccgctcttt 


ttaccgaaat 


acaacgggca 


600 


ctaaagcaac 


ccctttggaa 


aaggttgcca 


agaatagagg 


cggcgcagta 


cattcctttc 


660 


tatcaacaac 


aagattctca 


taacaagact 


ttacttaaac 


ttgctaagtt 


agagttcaat 


720 


ttgcttcagt 


cattgcacaa 


ggaagagctc 


agccatgtgt 


gcaaatggtg 


gaaagctttc 


780 


gatatcaaga 


agaacgcacc 


ttgtttaaga 


gatagaattg 


ttgaatgcta 


cttttgggga 


840 


ctaggttcag 


gctatgagcc 


acagtattcc 


cgggctagag 


ttttcttcac 


aaaagctgtt 


900 


gctgttataa 


ctcttataga 


tgacacttat 


gatgcgtatg 


gtacttatga 


agaacttaag 


960 


atctttactg 


aagctgttga 


aaggtggtca 


attacatgct 


tagacacact 


tccagaatac 


1020 


atgaaaccga 


tatacaaatt 


attcatggat 


acatacacag 


aaatggaaga 


atttcttgca 


1080 


aaggagggaa 


gaacagatct 


atttaactgc 


ggcaaagaat 


ttgtgaaaga 


gtttgttaga 


1140 


aacctgatgg 


ttgaagcaaa 


atgggcaaat 


gagggacaca 


taccaaccac 


tgaagagcat 


1200 


gatccagttg 


taatcattac 


tggcggtgct 


aacctgctta 


caacaacttg 


ttatcttggc 


1260 


atgagtgata 


tattcacaaa 


agagtctgtc 


gaatgggctg 


tctctgcacc 


tcctcttttt 


1320 


agatactcag 


gtatacttgg 


tcgacgccta 


aatgatctca 


tgacccacaa 


ggccgagcaa 


1380 


gaaagaaaac 


atagttcatc 


gagccttgaa 


agttatatga 


aggaatataa 


tgtcaatgag 


1440 


gagtatgccc 


aaaccttgat 


ttacaaggaa 


gtagaagatg 


tgtggaaaga 


tataaaccga 


1500 



gagtacctca caactaaaaa cattccaagg ccgttattga tggctgtgat ctatttgtgc 1560 

cagtttcttg aagttcaata tgcaggaaag gataacttca cacgtatggg agacgaatac 1620 

aaacatctca taaagtctct actcgtttat cctatgagta tatgactacc aatccttcgt 1680 

gcatagccta tcaattatat tgaaagggtt aactatgcac gtctctatgg agagaatttc 1740 

tcaagctatt tggtgtttct tgctggcaat aataaatcag acgcataaaa ttgtattgaa 1800 

ctatatgccg atagctattt aaagttatta tacaactaaa atattcaaca atggtattat 18 60 

acttttactt tgtacaaaag caaaagtaca ctactgttat gtaacatttt agttctatga 1920 

tactttagtt acgaatcggc ttatatacat tgatacactt ttatgcagaa aaccctagta 1980 

aataaaaagt cgatatcttg tactacacat atcgcacgaa tttccgtttg ccgtttgtat 2040 

tttacgatat gttatttaat gaatatgttt catgtggttg ttgcttaaaa aaaaagtcga 2100 

cgcggccgcg aa 2112 

<210> 12 
<211> 697 
<212> PRT 

<213> Artemisia annua L. 
<220> 

<223> Deduced amino acid sequence of a positive clone ( amorphadiene 
synthase encoding gene) isolated from the cDNA library of 
induced A. annua 



<400> 12 



Asn 


Ser 


Arg 


Pro 


Arg 


Arg 


Gin 


He 


Met 


Ser 


Leu 


Thr 


Glu 


Glu 


Lys 


Pro 


1 








5 










10 










15 




He 


Arg 


Pro 


He 
20 


Ala 


Asn 


Phe 


Pro 


Pro 
25 


Ser 


He 


Trp 


Gly 


Asp 
30 


Gin 


Phe 


Leu 


He 


Tyr 
35 


Gin 


Lys 


Gin 


Val 


Glu 
40 


Gin 


Gly 


Val 


Glu 


Gin 
45 


He 


Val 


Asn 


Asp 


Leu 
50 


Lys 


Lys 


Glu 


Val 


Arg 
55 


Gin 


Leu 


Leu 


Lys 


Glu 
60 


Ala 


Leu 


Asp 


He 


Pro 


Met 


Lys 


His 


Ala 


Asn 


Leu 


Leu 


Lys 


Leu 


He 


Asp 


Glu 


He 


Gin 


Arg 


65 










70 










75 










80 


Leu 


Gly 


He 


Pro 


Tyr 
85 


His 


Phe 


Glu 


Arg 


Glu 
90 


He 


Asp 


His 


Ala 


Leu 
95 


Gin 


Cys 


He 


Tyr 


Glu 
100 


Thr 


Tyr 


Gly 


Asp 


Asn 
105 


Trp 


Asn 


Gly 


Asp 


Arg 
110 


Ser 


Ser 


Leu 


Trp 


Phe 
115 


Arg 


Leu 


Met 


Arg 


Lys 
120 


Gin 


Gly 


Tyr 


Tyr 


Val 
125 


Thr 


Cys 


Asp 


Val 


Phe 
130 


Asn 


Asn 


Tyr 


Lys 


Asp 
135 


Lys 


Asn 


Gly 


Ala 


Phe 
140 


Lys 


Gin 


Ser 


Leu 


Ala 


Asn 


Asp 


Val 


Glu 


Gly 


Leu 


Leu 


Glu 


Leu 


Tyr 


Glu 


Ala 


Thr 


Ser 


Met 


145 










150 










155 










160 


Arg 


Val 


Pro 


Gly 


Glu 
165 


He 


He 


Leu 


Glu 


Asp 
170 


Ala 


Leu 


Gly 


Phe 


Thr 
175 


Arg 


Ser 


Arg 


Leu 


Ser 


He 


Met 


Thr 


Lys 


Asp 


Ala 


Phe 


Ser 


Thr 


Asn 


Pro 


Ala 



Leu 

Leu 

Asp 
225 
Leu 

Trp 

He 

Tyr 

Leu 
305 
He 

Leu 

Thr 

Asn 

Glu 
385 
Asp 

Cys 

Ala 

Arg 

Ser 
465 
Glu 

Asp 

Leu 

Gly 

Lys 
545 
Pro 

Asn 

Ala 

Asn 

Ser 
625 



Phe 

Pro 
210 
Ser 

Leu 

Lys 

Val 

Ser 
290 
He 

Phe 

Pro 

Glu 

Cys 
370 
Ala 

Pro 

Tyr 

Val 

Leu 
450 
Ser 

Tyr 

He 

Met 

Lys 
530 
Ser 

He 

Phe 

Asn 

Asn 
610 
Thr 



Thr 
195 
Arg 

His 

Gin 

Ala 

Glu 
275 
Arg 

Asp 

Thr 

Glu 

Met 
355 
Gly 

Lys 

Val 

Leu 

Ser 
435 
Asn 

Ser 

Ala 

Asn 

Ala 
515 
Asp 

Leu 

Asn 

Ser 

Cys 
595 
He 

Leu 



180 
Glu 

He 

Asn 

Ser 

Phe 
260 
Cys 

Ala 

Asp 

Glu 

Tyr 
340 
Glu 

Lys 

Trp 

Val 

Gly 
420 
Ala 

Asp 

Ser 

Gin 

Arg 
500 
Val 

Asn 

Leu 

Tyr 

Ser 
580 
He 

Gin 

Leu 



He 

Glu 

Lys 

Leu 
245 
Asp 

Tyr 

Arg 

Thr 

Ala 
325 
Met 

Glu 

Glu 

Ala 

He 
405 
Met 

Pro 

Leu 

Leu 

Thr 
485 
Glu 

He 

Phe 

Val 

He 
565 
Tyr 

Glu 

Gin 

Leu 



Gin 

Ala 

Thr 
230 
His 

He 

Phe 

Val 

Tyr 
310 
Val 

Lys 

Phe 

Phe 

Asn 
390 
He 

Ser 

Pro 

Met 

Glu 
470 
Leu 

Tyr 

Tyr 

Thr 

Tyr 
550 
Glu 

Leu 

Leu 

Trp 

Cys 
630 



Arg Ala 
200 
Ala Gin 
215 

Leu Leu 

Lys Glu 

Lys Lys 

Trp Gly 
280 
Phe Phe 
295 

Asp Ala 

Glu Arg 

Pro He 

Leu Ala 
360 
Val Lys 
375 

Glu Gly 

Thr Gly 

Asp He 

Leu Phe 
440 
Thr His 
455 

Ser Tyr 

He Tyr 

Leu Thr 

Leu Cys 
520 
Arg Met 
535 

Pro Met 

Arg Val 

Val Phe 

Tyr Ala 
600 
Tyr Tyr 
615 

Asn He 



185 

Leu Lys 

Tyr He 

Lys Leu 

Glu Leu 
250 
Asn Ala 
265 

Leu Gly 

Thr Lys 

Tyr Gly 

Trp Ser 
330 
Tyr Lys 
345 

Lys Glu 

Glu Phe 

His He 

Gly Ala 
410 
Phe Thr 
425 

Arg Tyr 

Lys Ala 

Met Lys 

Lys Glu 
490 
Thr Lys 
505 

Gin Phe 

Gly Asp 

Ser He 

Asn Tyr 
570 
Leu Ala 
585 

Asp Ser 
Thr Phe 
Leu Val 



Gin 

Pro 

Ala 
235 
Ser 

Pro 

Ser 

Ala 

Thr 
315 
He 

Leu 

Gly 

Val 

Pro 
395 
Asn 

Lys 

Ser 

Glu 

Glu 
475 
Val 

Asn 

Leu 

Glu 

Leu 
555 
Ala 

Gly 

Tyr 

Thr 

Leu 
635 



Pro 

Phe 
220 
Lys 

His 

Cys 

Gly 

Val 
300 
Tyr 

Thr 

Phe 

Arg 

Arg 
380 
Thr 

Leu 

Glu 

Gly 

Gin 
460 
Tyr 

Glu 

He 

Glu 

Tyr 
540 
Pro 

Arg 

Asn 

Leu 

Leu 
620 
Tyr 



190 
Leu Trp 
205 

Tyr Gin 

Leu Glu 

Val Cys 

Leu Arg 
270 
Tyr Glu 
285 

Ala Val 

Glu Glu 

Cys Leu 

Met Asp 
350 
Thr Asp 
365 

Asn Leu 

Thr Glu 

Leu Thr 

Ser Val 
430 
He Leu 
445 

Glu Arg 

Asn Val 

Asp Val 

Pro Arg 
510 
Val Gin 
525 

Lys His 

He Leu 

Leu Tyr 

Asn Lys 
590 
Lys Leu 
605 

Tyr Lys 
Phe Ser 



Lys Arg 

Gin Gin 

Phe Asn 
240 
Lys Trp 
255 

Asp Arg 

Pro Gin 

He Thr 

Leu Lys 
320 
Asp Thr 
335 

Thr Tyr 

Leu Phe 

Met Val 

Glu His 
400 
Thr Thr 
415 

Glu Trp 

Gly Arg 

Lys His 

Asn Glu 
480 
Trp Lys 
495 

Pro Leu 

Tyr Ala 

Leu He 

Arg Ala 
560 
Gly Glu 
575 

Ser Asp 

Leu Tyr 

Ser Lys 

Tyr Glu 
640 



Ser Ala Tyr lie His Tyr Thr Phe Met Gin Lys Thr Leu Val Asn Lys 

645 650 655 

Lys Ser lie Ser Cys Thr Thr His lie Ala Arg lie Ser Val Cys g 

660 665 670 

Leu Tyr Phe Thr lie Cys Tyr Leu Met Asn Met Phe His Val Val Val 

675 680 685 

Ala Lys Lys Ser Arg Arg Gly Arg Glu 
690 695 

<210> 13 
<211> 1649 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Nucleotide sequence of the amorphadiene synthase encoding gene, 
between start and stop codon, obtained by PGR with primers C 
and D 

<400> 13 



ccatggcact 


tacagaagaa 


aaacctattc 


gccccattgc 


caactttcct 


ccaagcattt 


60 


ggggagatca 


gtttctcatc 


tatcaaaagc 


aagtagagca 


aggggtggaa 


cagatagtga 


120 


atgatttaaa 


aaaagaagtg 


cggcaactac 


taaaagaagc 


tttggatatt 


cctatgaaac 


180 


atgccaattt 


gttgaagctg 


attgatgaaa 


ttcaacgcct 


tggaataccg 


tatcactttg 


240 


aacgggagat 


tgatcatgca 


ttgcaatgta 


tttatgaaac 


atatggtgat 


aactggaatg 


300 














~j u yj 


atgttttcaa 


taactataaa 


gacaaaaatg 


gagcgttcaa 


gcaatcgtta 


gctaatgatg 


420 


ttgaaggttt 


gcttgagttg 


tacgaagcaa 


cttctatgag 


ggtacctggg 


gagattatat 


480 


tagaagatgc 


tcttggtttt 


acacgatctc 


gtcttagcat 


tatgacaaaa 


gatgcttttt 


540 


ctacaaaccc 


cgctcttttt 


accgaaatac 


aacgggcact 


aaagcaaccc 


ctttggaaaa 


600 


ggttgccaag 


aatagaggcg 


gcgcagtaca 


ttcctttcta 


tcaacaacaa 


gattctcata 


660 


acaagacttt 


acttaaactt 


gctaagttag 


agttcaattt 


gcttcagtca 


ttgcacaagg 


720 


aagagctcag 


ccatgtgtgc 


aaatggtgga 


aagctttcga 


tatcaagaag 


aacgcacctt 


780 


gtttaagaga 


tagaattgtt 


gaatgctact 


tttggggact 


aggttcaggc 


tatgagccac 


840 


agtattcccg 


ggctagagtt 


ttcttcacaa 


aagctgttgc 


tgttataact 


cttatagatg 


900 


acacttatga 


tgcgtatggt 


acttatgaag 


aacttaagat 


ctttactgaa 


gctgttgaaa 


960 


ggtggtcaat 


tacatgctta 


gacacacttc 


cagaatacat 


gaaaccgata 


tacaaattat 


1020 


tcatggatac 


atacacagaa 


atggaagaat 


ttcttgcaaa 


ggagggaaga 


acagatctat 


1080 


ttaactgcgg 


caaagaattt 


gtgaaagagt 


ttgttagaaa 


cctgatggtt 


gaagcaaaat 


1140 



gggcaaatga gggacacata ccaaccactg aagagcatga tccagttgta atcattactg 1200 

gcggtgctaa cctgcttaca acaacttgtt atcttggcat gagtgatata ttcacaaaag 1260 

agtctgtcga atgggctgtc tctgcacctc ctctttttag atactcaggt atacttggtc 1320 

gacgcctaaa tgatctcatg acccacaagg ccgagcaaga aagaaaacat agttcatcga 1380 

gccttgaaag ttatatgaag gaatataatg tcaatgagga gtatgcccaa accttgattt 1440 

acaaggaagt agaagatgtg tggaaagata taaaccgaga gtacctcaca actaaaaaca 1500 

ttccaaggcc gttattgatg gctgtgatct atttgtgcca gtttcttgaa gttcaatatg 1560 

caggaaagga taacttcaca cgtatgggag acgaatacaa acatctcata aagtctctac 1620 

tcgtttatcc tatgagtata tgaggatcc 1649 

<210> 14 
<211> 549 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Deduced amino acid sequence of the amorphadiene synthase 

encoding gene, between start and stop codon, obtained by PGR 
with primers C and D 

<400> 14 

Thr Met Ala Leu Thr Glu Glu Lys Pro lie Arg Pro lie Ala Asn Phe 

5 10 15 

Pro Pro Ser lie Trp Gly Asp Gin Phe Leu lie Tyr Gin Lys Gin Val 

20 25 30 

Glu Gin Gly Val Glu Gin lie Val Asn Asp Leu Lys Lys Glu Val Arg 

35 40 45 

Gin Leu Leu Lys Glu Ala Leu Asp lie Pro Met Lys His Ala Asn Leu 

50 55 60 

Leu Lys Leu lie Asp Glu lie Gin Arg Leu Gly lie Pro Tyr His Phe 
65 70 75 80 

Glu Arg Glu lie Asp His Ala Leu Gin Cys lie Tyr Glu Thr Tyr Gly 

85 90 95 

Asp Asn Trp Asn Gly Asp Arg Ser Ser Leu Trp Phe Arg Leu Met Arg 

100 105 110 

Lys Gin Gly Tyr Tyr Val Thr Cys Asp Val Phe Asn Asn Tyr Lys Asp 

115 120 125 

Lys Asn Gly Ala Phe Lys Gin Ser Leu Ala Asn Asp Val Glu Gly Leu 

130 135 140 

Leu Glu Leu Tyr Glu Ala Thr Ser Met Arg Val Pro Gly Glu He He 
145 150 155 160 

Leu Glu Asp Ala Leu Gly Phe Thr Arg Ser Arg Leu Ser He Met Thr 

165 170 175 

Lys Asp Ala Phe Ser Thr Asn Pro Ala Leu Phe Thr Glu He Gin Arg 

180 185 190 

Ala Leu Lys Gin Pro Leu Trp Lys Arg Leu Pro Arg He Glu Ala Ala 

195 200 205 

Gin Tyr He Pro Phe Tyr Gin Gin Gin Asp Ser His Asn Lys Thr Leu 



210 
Leu Lys 
225 

Glu Glu 

Lys Asn 

Gly Leu 

Phe Thr 
290 
Ala Tyr 
305 

Arg Trp 

He Tyr 

Ala Lys 

Lys Glu 
370 
Gly His 
385 

Gly Gly 

He Phe 

Phe Arg 

His Lys 
450 
Tyr Met 
465 

Tyr Lys 

Thr Thr 

Cys Gin 

Met Gly 
530 
Met Ser 
545 



Leu 

Leu 

Ala 

Gly 
275 
Lys 

Gly 

Ser 

Lys 

Glu 
355 
Phe 

He 

Ala 

Thr 

Tyr 
435 
Ala 

Lys 

Glu 

Lys 

Phe 
515 
Asp 

He 



Ala 

Ser 

Pro 
260 
Ser 

Ala 

Thr 

He 

Leu 
340 
Gly 

Val 

Pro 

Asn 

Lys 
420 
Ser 

Glu 

Glu 

Val 

Asn 
500 
Leu 

Glu 

Gly 



Lys 

His 
245 
Cys 

Gly 

Val 

Tyr 

Thr 
325 
Phe 

Arg 

Arg 

Thr 

Leu 
405 
Glu 

Gly 

Gin 

Tyr 

Glu 
485 
He 

Glu 

Tyr 

Ser 



215 
Leu Glu 
230 

Val Cys 

Leu Arg 

Tyr Glu 

Ala Val 
295 
Glu Glu 
310 

Cys Leu 

Met Asp 

Thr Asp 

Asn Leu 
375 
Thr Glu 
390 

Leu Thr 

Ser Val 

He Leu 

Glu Arg 
455 
Asn Val 
470 

Asp Val 

Pro Arg 

Val Gin 

Lys His 
535 



Phe Asn 

Lys Trp 

Asp Arg 
265 
Pro Gin 
280 

He Thr 

Leu Lys 

Asp Thr 

Thr Tyr 
345 
Leu Phe 
360 

Met Val 

Glu His 

Thr Thr 

Glu Trp 
425 
Gly Arg 
440 

Lys His 

Asn Glu 

Trp Lys 

Pro Leu 
505 
Tyr Ala 
520 

Leu He 



Leu Leu 
235 
Trp Lys 
250 

He Val 

Tyr Ser 

Leu He 

He Phe 
315 
Leu Pro 
330 

Thr Glu 

Asn Cys 

Glu Ala 

Asp Pro 
395 
Cys Tyr 
410 

Ala Val 

Arg Leu 

Ser Ser 

Glu Tyr 
475 
Asp He 
490 

Leu Met 
Gly Lys 
Lys Ser 



220 

Gin Ser Leu 

Ala Phe Asp 

Glu Cys Tyr 
270 

Arg Ala Arg 
285 

Asp Asp Thr 
300 

Thr Glu Ala 

Glu Tyr Met 

Met Glu Glu 
350 

Gly Lys Glu 
365 

Lys Trp Ala 
380 

Val Val He 

Leu Gly Met 

Ser Ala Pro 
430 

Asn Asp Leu 
445 

Ser Ser Leu 
460 

Ala Gin Thr 

Asn Arg Glu 

Ala Val He 
510 

Asp Asn Phe 

525 
Leu Leu Val 
540 



His 

He 
255 
Phe 

Val 

Tyr 

Val 

Lys 
335 
Phe 

Phe 

Asn 

He 

Ser 
415 
Pro 

Met 

Glu 

Leu 

Tyr 
495 
Tyr 

Thr 

Tyr 



Lys 
240 
Lys 

Trp 

Phe 

Asp 

Glu 
320 
Pro 

Leu 

Val 

Glu 

Thr 
400 
Asp 

Leu 

Thr 

Ser 

He 
480 
Leu 

Leu 

Arg 

Pro 



